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The amendments to claims 38, 39, and 41 are supported by the present specification. 
For example, exemplary support for the term "fraction" is found in Example 12 and is shown 
in Figure 12. Additionally, support for "cell free medium" is found on page 34, line 10, and 
page 38, line 3. 

Because the foregoing amendments do not introduce new matter, entry thereof by the 
Examiner is respectfiilly requested. 

11. Summary of the Invention 

The present invention is directed to cell free mediums comprising a fusion protein of 
hPTH (1-84) and a leader sequence, and a method for obtaining intact hPTH (1-84) using 
such a cell free medium. 

PTH, which is an important regulator of calcium metabohsm in mammals, is related 
to several mammahan diseases, such as milk fever, acute hypocalsemia, and otherwise 
pathologically altered blood calcium levels. See page 2, lines 15-19, of the application. 
Through its action on target cells in bone and kidney tubuli, PTH increases serum calcium 
and decreases serum phosphate, while opposite effects are found regarding urinary excretion 
of calcium and phosphate. See page 5, lines 27-31, of the application. PTH is usefiil, for 
example, as a component of a diagnostic kit or as a therapeutic in human and veterinary 
medicine. See page 2, lines 20-22, of the specification. 

Prior to the present invention, PTH was commercially available only in very small 
quantities at high cost, partly because synthesis of the compound was difficult and complex. 
See page 1, lines 33-38, of the specification. Apphcants have overcome the problems of the 
prior art and discovered processes of preparing recombinant PTH using microorganisms. 



III. Issues Under Objections 

The Examiner objected to the specification for allegedly failing to provide antecedent 
basis in the specification for the recitation of "greater than 90%" purity of the claimed 
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protein. Applicants note that the present specification provides antecedent basis for this 
recitation and direct the Examiner's attention to page 7, lines 30-32, where it states: "The 
sequence analysis indicated that the recombinant hPTH was more than 90 percent pure." 
Therefore, Applicants respectifully request reconsideration and withdrawal of the objection. 

The Examiner also objected to the specification because the information on page 1, 
lines 20-23, appears to be duphcative of the first paragraph of the same page. Applicants 
have amended the specification by deleting lines 20-23 on page 1. 



IV. Claim Rejections - 35 U.S.C. S 112. First Paragraph 

Claims 34 and 52 are rejected by the examiner under 35 U.S.C. § 1 12, first paragraph, 
for alleged lack of enablement. AppUcants have canceled claims 34 and 52, thus obviating 
the rejection of these claims. The cancellation of claims does not constitute acquiescence in . 
the propriety of any rejection set forth by the Examiner. 

Claims 24-29, 34-47, and 52-56 were rejected by the examiner under 35 U.S.C. § 1 12, 
first paragraph, for alleged lack of enablement. The examiner asserted that the specification 
provides enablement for Saccharomyces mating factor al leader sequence for use in yeast; 
however, the specification allegedly does not provide enablement for all leader sequences for 
use in all microorganisms. Applicants respectfiilly request reconsideration and withdrawal of 
the rejection. 

Applicants have canceled claims 24-29, 34-37, 40, 42-47, and 52-56, thus obviating 
the rejection of these claims. The cancellation of claims does not constitute acquiescence in 
the propriety of any rejection set forth by the Examiner. 

A. Numerous Signal Sequences were Characterized and Well 

Known in the art at the Time the Claimed Invention was Made 

With respect to claims 38, 39, and 41, the attached articles demonstrate that the use of 
various leader sequences in various microorganisms was well known in the art at the time the 
claimed invention was made. For example. Von Heune, "Patterns of Amino Acids Near 
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Signal-Sequence Cleavage Sites," ^wr. Biochem., 133\\l-2\ (1983) (Exhibit 1), teaches 
that the patterning of amino acids near the cleavage site between a signal sequence and a 
mature protein "is much richer than hitherto beUeved, and that, given this information, 
cleavage sites caii be predicted quite successMly." Abstract of Von Heune. this reference 
presents an analysis of a comprehensive collection of known eukaryotic signal sequences, 
ahgned with coincident cleavage sites (Table 1). 

In addition, von Heijne, "Signal Sequences; The Limits of Variation,"/. Mol Biol, 
184:99-105 (1985) (Exhibit 2), teach mapping of a large sample of N-terminal and C-terminal 
regions of signal sequences, including eukaryotic and prokaryotic signal sequences. This 
reference further teaches that "[a]ll signal sequences seem to be built along the same general 
lines , . ." Col 1, page 99. The analysis of eukaryotic and prokaryotic signal sequences 
shows subtle differences between the two types, and suggests "minimal" requirements to 
which a fiilly functional signal sequence must conform. Id. 

Moreover, Watson, "Compilation of PubUshed Signal Sequences," 72(13):5 MS- 
SI 64(1 984) (Exhibit 3), teaches that the structure, function, and processing of signal 
sequences has been reviewed (page SI 46), and follows with a compilation of pubhshed signal 
sequences (pp. 5146-S157). The proteins hsted are grouped into genera, the proteins are 
classified as inner membrane, outer membrane, periplasmic, or transmembrane, where 
appropriate, and the first ten amino acid residues of the protein sequences are also given. - 

About 189 eukaryotic signal sequences are Usted, including sequences from baboon, 
bovine, canine, hamster, human, monkey, murine, ovine, porcine, rabbit, rat, chicken, caiman 
crocodylus xenopus, angler fish carp, catfsh, magfish, salmon, torpedo, winter flounder, bee, 
Drosophila melanogaster, Brasica napus, Hordeum vulgare, Phaseolus vulgaris, Pisum 
sativum, Zea maize, Saccharomyces, Plasmodium knowlesi, and Trypandsoma brucei. Over 
24 viral signal sequences are listed, including sequences from adeno virus 2, herpes simplex, 
avian influenza, human influenza, mouse mammary tumour virus, rabies virus, roue sarcoma 
virus, simian rotavirus-SAl 1, vesicular stomatitis virus - hamster, vesicular stomatitis virus - 
human, and yeast killer. Over S5 prokaryotic signal sequences are hsted, including sequences 
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from Bacillus amyloliquitagiens, Bacillus cereus, Bacillus licheniformis, Bacillus subtilis, 
Bacteroides modosus, Corynebacterium ditheriae, Escherichia coli, Zrwinia Amylovora, 
Enterobacter aerogenes, Halobacterium Malonium, H. morganii, Moraxella nonliquefaciens, 
Neisseria gonorrhoeae, Pseudomonas sp., Salmonella typhimurium, Serratia hargescens, 
Shigella dysenteriae Staphlococcus aureus and Vibrio cholerae. 

Thus, the characterization of numerous signal sequences was well known in the art at 
the time the claimed invention was made. 

B. Numerous Expression Systems Were Known in 

the art at the Time the Claimed Invention was Made 

The cited references (Exhibits 1, 2, and 3) also identify a wide variety of expression 
systems that can be used for making recombinant proteins, including eukaryotic, prokaryotic, 
and viral expression systems. 

As signal sequences other than Saccharomyces mating factor alpha- 1, and expression 
systems other than yeast, were well known in the art at the time the claimed invention was 
made, Applicants' claimed invention is not limited to these specific embodiments to satisfy 
the enablement requirement of 35 U.S.C. § 1 12, first paragraph. Withdrawal of this ground 
for rejection is respectfully requested. 

V. Claim Rejections - 35 U.S>C. § 112, Second Paragraph 

Claim 42 was rejected by the Examiner under 35 U.S.C. § 1 12, second paragraph, for 
reciting "STE13 recognition". Applicants have canceled claim 42, thus obviating the 
rejection of this claim. The cancellation of claims does not constitute acquiescence in the 
propriety of any rejection set forth by the Examiner. 

Claims 24-29, 34-37, 38-47, and 52-56 were rejected by the Examiner under 35 
U.S.C. § 1 12, second paragraph, because it is allegedly unclear how and where the leader 
sequence is cleaved in microorganisms other than yeast. Applicants have canceled claims 
24-29, 34-47, and 52-56, thus obviating the rejection of these claims. The cancellation of 
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claims does not constitute acquiescence in the propriety of any rejection set forth by the 
Examiner. 

the knowledge in the art at the time the claimed invention was made regarding signal 
sequences and expression systems. In particular, it is noted that von Heune (Exhibit 1) 
teaches that using the cleave prediction method described in the reference, only 5 out of 76 
processing sites were incorrectly predicted. Col. 2, page 17. Thus, use of various signal 
sequences in various expression systems does not render the invention unpredictable or 
indefinite. For at least these reasons, withdrawal of the ground for rejection is respectfully 
requested. 

Claims 38, 41, 42, 45, 48, 52, and 54 were rejected by the Examiner under 35 U.S.C. 
§ 1 12, second paragraph, because the examiner asserts that the term "component" is 
confusing. Applicants have canceled claims 42, 48, 52 and 54, thus obviating the rejection of 
these claims. The cancellation of claims does not constitute acquiescence in the propriety of 
any rejection set forth by the Examiner. 

With respect to claims 38 and 41, Apphcants have replaced the term "component" 
with the term "fraction". The fraction is a result of the column chromatography that is 
described in Example 12. Therefore, support for "fraction" is found in the specification in 
Example 12 and is shown in Figure 12. Additionally, the term "fraction" is a well-recognized 
term of art. 

Claims 44 and 47 were rejected by the Examiner under 35 U.S.C. § 1 12, second 
paragraph, because claims 44 and 47 should allegedly refer to part (d) instead of part (c) of 
claims 43 and 45, respectively. Applicants have canceled claims 44 and 47, thus obviating 
the rejection of these claims. The cancellation of claims does not constitute acquiescence in 
the propriety of any rejection set forth by the Examiner. 

Claims 24-29, 34-37, 42-47, and 52-56 were rejected by the Examiner under 35 
U.S.C. § 112, second paragraph, because it is allegedly not clear if the PTH is purified before 
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or after it becomes part of the claimed composition. Applicants have canceled claims 24-29, 
34-37, 42-47, and 52-56, thus rendering the rejection of these claims moot. The cancellation 
of claims does not constitute acquiescence in the propriety of any rejection set forth by the 
Examiner. 

VI. Claim Rejections - 35 U.S.C. 8 102 

Claims 24-29, 34-37, 42-47, and 52-56 were rejected by the Examiner under 35 
U.S.C. § 102 as being unpatentable over either Keutmaim et al. or Kimura et al. AppHcants 
have canceled claims 24-29, 34-37, 42-47, and 52-56, thus obviating the rejection of these 
claims. The cancellation of claims does not constitute acquiescence in the propriety of any 
rejection set forth by the Examiner. 

Claims 24-29, 34-37, 42-47, and 52-56 were rejected by the examiner under 35 U.S.C. 
§ 102(b) as being allegedly anticipated by either Brewer et al. (U.S. Patent No. 3,886,132), 
Kumagaye et al. {J. Chrom., 327:327), or Fairwell et al. {Biochem, 22:2691). Apphcants 
have canceled claims 24-29, 34-37, 42-47, and 52-56, thus obviating the rejection of these 
claims. The cancellation of claims does not constitute acquiescence in the propriety of any 
rejection set forth by the Examiner. 

VII. Rejections Under 35 U.S.C. S 102/103 

Claims 24-29, 34-37, 42-27, and 52-56 were rejected by the Examiner under 35 
U.S.C. § 102 as being allegedly anticipated by, or under 35 U.S.C. § 103 as being allegedly 
obvious over. Brewer et al. '132. Applicants have canceled claims 24-29, 34-37, 42-47, and 
52-56, thus obviating the rejection of these claims. The cancellation of claims does not 
constitute acquiescence in the propriety of any rejection set forth by the Examiner. 

Claims 24-29, 34-37, 42-47, and 52-56 were rejected as being allegedly anticipated 
under 35 U.S.C. § 102 by, or allegedly obvious under 35 U.S.C. § 103 over, Kumagaye et al, 
Kimura et al, or Fairwell et al. Apphcants have canceled claims 24-29, 34-37, 42-47, and 
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52-56, thus obviating the rejection of these claims. The cancellation of claims does not 
constitute acquiescence in the propriety of any rejection set forth by the Examiner. 

VIIL Provisional Obviousness-type Double Patenting Rejection Under 35 U.S.C. § 101 

Claim.s 24-29, 34-37, 42-47, and 52-56 were provisionally rejected under the 
judicially created doctrine of obviousness-type double patenting over claims 31-35 of 
copending AppUcation No. 08/340,664. Applicants have canceled claims 24-29, 34-37, 42- 
47, and 52-56, thus obviating the provisional rejection of these claims. The cancellation of 
claims does not constitute acquiescence in the propriety of any rejection set forth by the 
Examiner. 



002.704781.1 



-9- 



Atty. Dkt. No. 016777/0309 Kaare M. GAUTVIK et aL 

Serial No. 09/287,332 

CONCLUSION 

As the above-presented amendments and remarks address and overcome all of the 
rejections presented by the Examiner, withdrawal of the rejections and allowance of the 
claims are respectfully requested. 

If the Examiner has any questions concerning this application, he or she is requested 
to contact the undersigned. 



Date July 15.2002 

FOLEY & LARDNER 
Washington Harbour 
3000 K Street, N.W., Suite 500 
Washington, D.C. 20007-5109 
Telephone: (202) 672-5538 
Facsimile: (202) 672-5399 




Respectfully submitted, 

Michele M. Simkin 
Attorney for Applicant 
Registration No. 34,717 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

38. (Amended) A cell free medium [An extract] obtained following growth of a 
microorganism transformed to express a DNA sequence encoding a fiision product in which 
hPTH(l-84) is fused at its N-terminus with a leader sequence, wherein: 

(a) said leader sequence is cleavable by said microorganism upon production of 
said fusion product; 

(b) said cell free mediiun [extract] comprises a PTH fraction [component]; and 

(c) said PTH fraction [component] consists essentially of intact [PTH] hPTHfl- 
84} molecules. 

39. (Amended) The cell free medium [An extract] according to claim 38, wherein 
the PTH fraction [component] consists of [PTH molecules that are] intact [PTH(l-84)] 
hPTH(l-84) molecules. 

41 . A method for obtaining intact hPTH(l-84), comprising 

(a) obtaining [an extract following growth of a microorganism transformed to 
express a DNA sequence encoding a fusion product in which hPTH(l-84) is fused at its N- 
terminus with a leader sequence, wherein] the cell free medium of claim 38 r: 

(i) said leader sequence is cleavable by said microorganism upon 
production of said fusion product; 

(ii) said extract comprises a PTH component; and 

(iii) said PTH component consists essentially of intact PTH molecules]; and 

(b) treating said [extract] medium to isolate [the PTH component thereof] intact 
hPTH(l-84) . 
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•pi' Tltt^okcular Ms f«r the highly prewcleaTOgeb^^ 
1^^-^^ seqii6lie» and mature exported proteins b» been 
^^ibnttwhat of an enigma cvtf Aince the tlnft pic-proieipfi wers 
P^iienced some tec ycaxa ago [Ij. t!» Ust icsiduc of the Bignul 
S^uence ha* been found lo be cither A!a» Gly, Scr, Cys, Thr or 
i j®51n, one wilb s btojiU. unch af g«l aidc-chflin, a^d it has mo 
^ scqoeace !tia& ft \'ery hydrophobic 

i§l*tUraI:COTePl hut Mither Uic positioning of this core wlali Via 
the ckava^ site; nor the patterns ofacaino acids found al 
silo h»\e M5 far allowed ^ precise prediction of the poailiofl 
'^^thcpoint of deavage from sequence data alone. In this paper, 
^ow that ihd patterniiig of amiDO acid5 nw ttie deava^e 
J is flxnch richer than hitherto believed^ and ihal« given this 
btxnation, cleavage aitea can be prcdieiMl guile ft:cee»ftiMy> 



liltESULTS AND DISCUSSION 

fW Analysis of a comprchcntrivc collection of known eukaTy- 
Ig^iic Big^ scqueoocs, aligned with coincident cleavage ^ites 

• l^l^ahle 1), reveals some intcrmiting rcgulariiicjf. As shown in 
^IjfTuble 2, whote dsmcA of residocs ,vc ub'jcem not only from 
liiir^oaition -1 (the last fwldoc of thosiBnal aeqiicnce, ci above) 
■ ||^|»it also from poshion - 3» and» conversely, the $Amc t>'pcii of 
JrSimmcBcidft as in position -1 ate strongly preferred in position 
^ 3. Also, these asBll« neutral residues are quite rare in posi tion 
I'i *- Z (there U only one Ala in position - 1 in contrast t o position 
'ill^l wi* 3d, and portion -3 with 25 alanine*). 

• " Moreover, the bulky aromatic ctaidues seem to be selected 
^ l^^for fa position - whereas thesecondary-ttnictnuhdiw-upting 
j^Jiesidiuv Oly and Pro are found predominantly in positions -4 
W^iBid -5, Tcspeetivdy. In addition, there is not a dn^le Pro 
Pl^pldue is the whole region between -3 and f 1. 

The hydrophobic residDts seem to be merely tolerated in 
rtgspositioiis +1 and -2 to -4; they are cuiispicttmisly rare 
' posilioa - 5, and then become dominant from powtion - 6 
.minwaa^ as one fiirlfier into the central hydrophobic core 
|^($eeabove). 

;^>?. Finally, although chai^ residues are tolcraitcd to some 
M^tem in posilionai -1 -A end -5, the)- are much mort 

• Sftbundant In ibe matore'ficqucnccji, Kiartiog from position +1. 

A sivuhr analysis of a smaller sample with all iios^rmes 

f^bd obviously related scqiwnces removed yields csseoiially 
^tjlentlcal icfrolts (data aot shown). 



ThcHS obsctvations make it possible to roiroulatc a pre- 
liminary set of rule* that can be used to predict the point of 
cica va^ in a gWen pre-protda sct|ucnce. At this $tago» we have 
felt it raaaonable to deal with duiiws of amino edds rather than 
individual ones, and the following scheme oan i hus be refined as 
more sequences aisd data for individutd residues become 
availuhlftr 

The pmdietion method proposed here has two componcniif ; 
firsu the N terminus of the hydrophobic core ij located by 
searching from the N terminus for tlw first quadruplet with ai 
least three hydrophobic residues (group [V hi Table 2, not 
ocnintlng the initiator Mcl), and defining a >M»dow" for 
ptooewHne beLween residues i * 12 and f+iO (; being the firil 
residue in the quadruplet). Then, for each residue in tl:te 
'window*. amcHHure of 'prooessinfi probability' b calculated by 
muliipiyittg together the respective values from Table \ values 
chosen "by inspection' so as roughly to reflect the amino acid 
pactenu discussed ahovct and fine-nuited to ntaximhix the 
number of conocl pitdiciions. Finally, Ibc site of processing 
prcdicied as the nte with the liighrat 'processing prubabilil/. 

Applying this ^diemc io the seqnenceshstcd in Table if only 
5 oui of 76 processing sites are incorrectly prediaed (oos 1 S, 2i 
37, 41 45), with one additional case where the TTicthod cjjTinoi 
diiTerentiale betweeft the correct and one incorrect site (no. 5t), 

Usmg less <han the full set of ruks, it wa& found 1 hal the rule 
'no Pro in position + T can be discarded wiihoui affecting the 
num|>er of correct and underidcd predictions for the present 
data base. All other rules are important, though. As an 
iUustratioa, using only the rule for position -1 yields 16 
iflcprrccl and 39 undecided piwIlctioiiB. and adding the rule for 
pooition -3 only improves these figures to 54 incorruct and 1$ 
undei^dfid prediettonS' 

The afliino acid patterns presented here can perhaps also 
yidd same insight into the actual structure of the signal- 
sequence-protease complex* As shown above, positions -l 
ttnd - 3 seem to be strongly selected for snrnU, neutral residues, 
whereas positions +1, -2^ and -4 seem to accomodate 
ahnost any kind of residue. Worco w, there is not a single Pro 
lifom position -3 to +1, and, f\jrtiicr, this region is often 
separated from the hydrophobic core of the signal sequence by 
the strong hdix-bieakers Pro or Gty p]. 

Thus> a possible signal-sequBflce - protease structure would 
be an alternating sheet stntcture near the cleavage site, joined ai 
its N terminus to a largely hydrophobic bciix {Kig. 1). Possibly, 
position 41 is located close to the membrane surface, as 
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: :^ri [indicated, by (he observed greater proportiou tii chartgiA 
:! •|1ij|9idue» in m position (Table 2). 
'i A atrtidtuiv of this kind would need some 8 or 9 h«Hcal 
'; :fi |j^due& added to the 4 or 5 id the extended sheet co»rormalion 
|[|j(p-'itadi tbrovgh ih^ oon-politr intcriar of the ituiiAbraaet a 
! || jj^ture tli«(comp«i^ quite well wiih the Icnj^th of the undiargod 
iij^lfjiiifiln^ of the sbortest known yignal sequencer 13 residues. 

A xiniilar proposal has been ntade easier on thebo^U of ad 
|;i§^aJysis of pradicted secaadary structures for si^al fiequences 
ji'iiiil.aod (1)0 model ^ aUo in ngreeznent with receiikt lesuUs trom 

eneT^mhiiittuatioiL study of tbc coaronnation of oofi 
||||ii7ticular (Tigod fiequence {5], 

A signal scquenn; would thus conltdn two din)»reiit and 
K||fiVBely indq^endent 'aigjiAU' : one in I he form of a hydropliobio 



core, presumably responsible fcM- Wiiating export and for 
bindiag to ihp 'ugnal r^ogniuon protein' (6], and a second one 
in ibe region -5 to - 1 confcning prOCefioiit^ spccindiy. 

This analysis 'sm bttn based on a ooneeiJon of eukoiyotic 
agnal Kcqucnocs and it might be Hskcd bow wdl ptokaiyotic 
ones conform to the patterns presented here. UnTortanatcly, 
orly Eome 16 prokaryotic signal sequences arc known so that 
too great a reliance an a statistical analysis would hardly be 
Justified at tld& poht. From the limited data available it seems, 
however, that the miLy obviokis differences arc limiEed to a more 
pronounced exdusion of chur^ residues from the prokaiy otic 
eeqneoces (exoept, of coorsej at Che N teimlnus)) and a strong 
prcferonoc for smaD, neutral residues in position - 6 (data not 
shown). 
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rwidwit tapflcifioni -I iBd Oliatoft poctei IB th»^i^ 
dcHriini the duvagii between poEitions — 1 and + 1 



Mm scnendly, prokaryotic signal sequcncot ait riclker ia 
AU and poorer in Lev tlian eukaryotic ones t?). 

la cpttdiision, It seems that a ^listical af^oach ti> the 
problem of signal se<)UfiYioe prccessmg can yield valuable 
iofonuatioD pot readily obulDable from expeiinmtal work, 
and may, with iuitabk itfinomence. provide a good basis for 
schemes ptodictlng the 6it« at which proccsnag of a glvea pic 
protein ficqueoop will uiicc place. 



Ihu jui «i 



TO ^1526725399 



P. 08/36 




'.'\'"y\ 41^ 




I. Strouwi. A.W. & B<jiiBC I. 0SIJ2) CKC Crft. itev. .ff£ui*««- 

20S-235, 

i voft HflSftC G, Cl!^*3l} A/fl/. 5«p/. J55» 537 -541. 

s! ipincaii M R * XUiuwCf; R.D. (1982) ASw* .««za: ^cl hc>A.7y, 

341^-3417. ••""y^^ 

6, Wftlter,t»., Ihrihiini. I fttoH G. (19W)/ CW/ AVi/, f/. 545-550. 

1 voo Heiipe* O. (I9tt> Etft, /. jWof-fcp*t. IW, 419 -422. 

S. Dand*ir.A.M.»llobitiwn,E.A.,Appcito.ll.&Qua»ba,P.K.(l98^ 

/Voci«(iffi«ttrf.^<iaSi,7P,39&7"JWi. • -'^^ 

9. Ric«.O.A.4TayKJ.M.(l9«0Xiiaf.CJtoB.25d,llI99-lim 

10 rHin, W. W., Knmenbcfft H- M., P. C. Mftloof. F. & Hnlj«ff. 
J. K (1981) Pron. Nal Arvl Sd, UV^. 7B, 5329 • S3 J3. 

I I. Oun. S. J. Ewufin. S., Kwok, B. C M,, Kttiaef , i. M., Falkiacf, S. i 

SteincT, D.F- (1981) / flW, C»wm. ?56, 7Si;5-7<01, 
12. Hohttri,P. M., Shcfl, L^P.,Cw«for4 R-^PlCWt, R. U A Rmtcr, W.J. 

(mO) (l^<ah. DC? 2/0, 1 JSO- S5fi3. 

a Bom G.I, W.F., Pietet, CwdeH. B., GtH^dmne. ft *. 

Ruttcr, W. i. (1S79) tow fAorJrf.; 282, 525-527. 
U Umedicct Roamihal. N., EfttT««db, A.. Gilbert* W., Koloilwf, 

lL4TWd,R.(1979)Cfl//,M,54*-3SlL . 
15 Mereictp J.-C A, Gayc P. (J9<0) Aiuu S',Y, Ai^. Sd 232-- 251. 
16. LinBWa.V.R..>C2aP.N.Li)dt8h.IiF.AI3tobe>,G.(rt^^ 

Ctm. 25 J. S667-IJ670. ^.^ .^^ . 

17 Chin, U Bttdlcy. W,A. A McMim A. JC (1980) / BM. O^, w, vf/ji-^i 

looco-ioofij. ^"S^m 

IB. MoUv»C.VIlai,U.&Kxcil,a{im)iVi,c./^Mc<|d;S^ 

2260 -2263. liA^tS^ 

19. MclCflaOr DJ. A Maurer, R. A. (1978) Bhcititmtry. /7, 5115- 5219. ,j.":r;;--^ 

20. Sherwood, LM.»Buntdn,Y. A SdiBAtef J. (1979k ?rw^^^^^ 

SdL t/aw, ?6, 38t9-3ll». ^ , S 

21. Pirkisiu S.. Felhcrstufl, J.* CanficW, JK. & J^oiine, l. (1581) J. fiU. ^ r. 

C/(«ffl. 25A; iai6-lS23. iH\ 

H. 4'lddo,J.C.&Goodiwa,H.M.(1979)No(u;c('W;2*/,J51-3S&. 
23. Mflldcy, hLUMlock, P.W^ Wilhrd, J.^ft W«rd, D.N. (1979;// 

tf(0/. Oert. 2S4, 1580-3W5, if^i^:^! 
2d. Swbiff, P. a, Hhinc J.. MartinJi J. A., Biwtcr, j.D. Gm>dtnan.;„v:;;' 

H. M. (1977) J\to«rtf f to*/.; 270, 4M-494v lli'i 

25. Martial, J.A.» Hiilhw«U, B-A., BaaUtr, J.U A Goodman, H. M. - 

(1979) 5cjlwr! (Wa'th. nC) 205, ; , ■■; 

26. Miller. W. L, Miotiiii. J. A, A BhiIct, I. D. (1 9I») /. Blot. ChtnL 2;J. :.. 

7521-7S24. . 

27. Kionmbtt^, H.M., M(^)pvitl, R,R., MAP'-^uh. J.A, Natli»Mj...;?:ii^J;S 

Sbaip, P. A.. ^>IU» P. A. & Rid), A. (1979) Prof. Naii Acii^, Arf- A; ig;; ^; 

2S. HudM^P,HaJey.^ClronV.M.,Sh^llc,J ^KllJl. H.(I981) Afaftrf* 

r£i»rtd:; 2PM27-13(. . 'ii^iiii^^i^i 

29. T.D., YM& M. A Beantf, J. (19«1) mc -A^fl*/ i^^wf- So. J>.;U-^^' 

w:4, ^,243-246. 

30. DugoKiyk, A.. Law, S. W. & UeaaUoa. 0. H (I9B2) iVw. .Vfftf ^ t^,ilV^.|,,„ 

M. SlnuB^ A.W., Bennelt. CD.. I>6aohw, A.M., R<>dkey, J.A. «|;?:?vr. 

Albcm, A. W, am J' Btol Chsm. 2J2, tm^^S^, - 
32. IC4ff,S.R-AFo»tftr,J.A.(J98l)/£fa^t::^it2Jtf,S946-5949. 
a, Piilnte,R.D..T!ubode»u,S.R, Rogers, G.&B6jnic, l(19M).4w<.C^i^^M 

JV.r /(coi iSWL 3«, 192-209 I • ^ 

34. Uicht,M., Long, a U Chandra. T, Kunichi. K.. Kidd. V J., Midi. 

M., D»vic, E. W. & WoD» S.t. (1982) .Vo/n^i (Lend,) fii5 - r, 
659. ' ^.W^- 

35. Paiktf . M., Neetftom. M. & While. R. (1982) r^ft<t; 2«?> jg^|i;:; 

92—94. "i>jii^/-7:i.^ 

36. Pcwsoa. H., Jof »tv»n. H. & Zabictskt> J. (198^) Pr^c. Sail Actitl. Sd ^jj^. 

t/W,77,*i349-<i353. ^'^^fT 

37. SKrfjto, W., RlobeL G, * Waller, P. fl9Bl) Eur. J, Biochem, ^^i&v'r 

519-527. W'' 



From 



JUL 11 2002 
(613) 95Z-8247 



14:03 FRi 
Order T 



. PHARMfiCEUTICfiLS 
l93316bDK0iiilUl<l^ 



mu jui «t 



TO ^126725399 



P. 09/36 



r'v'iS- XjiiitoTlis A.. Wunwr, W.H.. & Oirlis, P.J. t^^ntre (iMti) 5fl. MaLuiUhL S.. I 

Vfin Jou, W.. Vcflwcycn, M.. Ocvaj. ft., Saman, R, Fao£» R.. 
;Q liuylrfworck, f toa. W.. Hm^. G.. Barter. a» Cm^. N. <fr 

^'•••I'-^Sft- GoodH I>'V, Ijwna, D. W., Ml, T,J, Gros^ M, Uwd. R.M., 

., p. V. ^^nw fXirtirf.; iw, 2D»2d. 

:;S feS: iaivPu R-, Adclmaa, J., Oall. T. J-» Ok^m, M., docddcl, D. <b UIWA, 

A SWteBOfl (WojIu DC) 212. tl59-ll62, 
'Sng Oi« P. W. AOocddcL P.V. 09»2) Hmwe fUnd.^ 298. 859-F63. 

^' « wciramuu, C, YiSiu) iVowc; (Xu«i'./ ioJ, i^T/ - ivv. 
i^'.'Wltclrt tf. i.> Wdf, ()., Kimtor. r.,&jhRAl«r, B. * ainstein, Y. (1 WO) 
111^- ^Bn. ^LK Six ^ 2IR»-?3t. „ , ^ ^ , , 

S^ISSOtvci, tJ.,Zikul, ft.. BBhjn, IC. ^BtH^vU O., Run, D. A Cohen. J.B. 

Ci«iJfli. KD* (1982) St/CTCc fH-VjA. DC) 309 310. 



231-251. 
a. Chan. 25^, 

S2l5-Sai9, 
981) 7. WoJ; 
.,N.0979)-r. 

: jmiuu II, M. 
:i.Cftan,2SS, 

Nithini, 

^ 1// Aca^ Sci, 

J.A. & 
.VJ..Macr. 
nlAcad^Sd, I 



: f ; 'i! C w p.. Hu*. R4ym(«Kl> Hajje, a * Menicr, J-C ( 
i^^^k QiHaHJ, a, Quirogiw ML, Swain, W. F,. NifcovUi. W.G. &tiutt!tla$. 
^wd^fctWM. 79; 31-35. 

ill- 

!?liil'vwi Hciims, rofito»BiBrttpp«i VH Tcwxilsk Bw(y<il; IniiiUiiiotien for Teoieiifik Fyjtik, 
ir!; jl^^g&fiA T<it^ HSgslmUn, ^ too 44 StnclrhnK 

•j::^^ lie • 



bovt. Ah KitB,T., NftlOTUw. M., Chang, A.CY., 

, . & Nutna, S. (t"J79) Mttiw* {//uuLi 27&, 42.V4a7. 

j1 . KaidiS H.. Fttfuianl. Y.. 'la)mha$h[, Node. M, Mwttnoiu, Y., 
KifOHc, Aiai, M., iDey&mn, S., Nafcanishi, 5. & Niniia. S. (t W2) 
iVa/uf* /W; MS - 249. 

52. CfMiib, M.» Sed)»«. P. H., Adchmitl /. , Brten. I.. & Hcrbort, E. 
^amre (LaidJ 29f, 663 - 666. 

53, Jocn Y 00, 0.. PowbII, T. & Ajforwal, C L (1 9jf2) >Yce; A^of/ Sci 
79, t049-lCl53. 

<i4. ptmthier. 7.-J., Footc S*, CbKmMftud, fi.. Stmbtirg, A. D., Corvot, P. 

ft RoPBCtm, F. (1982) ATotenr a^rfj 29S, 90-91 
55. Ria;Ftd»1. A-C. Ch^ K,K. A DAodauii, J.R (1962) A&i/wt 

36. Magaxia, M„ Minih, C. D.. i'un<i«, C I^. Deschcnca, R.. Taviwilni. 
M. A. & Dixon, j: K (19((2) /V*wr. ^tfrf --^fOi/, ^ct. f/S^, 7P. 5152- 
5f.56. 

37. OQQdmHn, R. IL. Jacotei^ J. W., Chin, W.. Uind. t. K., Dee, P. C. & 
Halwncr. J- (1980) rm. Kml Aead, Sd. USA. ^, 5669 -5lt73. 

38. Hobart. K Ciawfonl. R., Slwn. R. & Rullcr, W.J. i,mO) 
.Wfliw/ flW.; 137" 141. 

59. JicoH J. W„ GoDdman. R.H., Chia, W. W.» Doe, P.C-, Hotw^r. 
J,F.» Bdl. N.H. & Potts, J,T, (1981) Scimee (Wash, DC) 213. 
457-4*9, 

60. \jffii\ P. K. .Goodmiin, R. H., OcPp P. C ft Hfthcncr, J, F. (19S2) iVw. 
/^o/; Acaif, Sol l/SAr 79, 345-349, 



JUL 11 2002 14:03 FR 
From (613) 95Z-8247 Order 



PHi=lRMACEUTICfiLS 




T0^iB26725399 

Jul 4 tUUL 



Signal Sequences 
The Limits of Vftriatton 



p. 10/36 



Ounnar von HeijiM: 

Tieaearch Group for Thecrciical Bkphjdta 
Dftptxftment of Theomticitt Phtf^c^, Hoyal ImtHuin of TntMnrntpj 
^. 100 U atochkc^m. Swtdm 



(ke<idvci 29 Octobes' tBS4, md tn revised form 2G Feferwry 29^5) 

Varinilond length and rmmrioNitioii of th6 Charged N-tfvminalt oentral hydrophabiq and 
paUr C-tt^inAl regiotkei in a large samplb of aignal Bequencos h4vO been mapped, both m » 
fwnction of the oven»ll length of tho ecquauce, 6Jad in (in alnwlutc eoftw/ t.c. vftrious 
'*cxtP«fmea'-' haw been sought. Th« rasuUH ahaw ttubtle diiferenoeu betwOtn oukaryotic And 
prok&iTOtic o«^ql:enc«l», but the general impruBfiian of Aigu^ a^ii«ncea bein^ highly 
rArifth|c» IB rein£bw»d. Critoj-U far a **iiiinimiiJ** s^ual sequence are sagse&Ud and di8im»K»i. 



1. Introductioii 

In the prooQRs of pot«in tifcport n u^ntraj i^olo is 
pj&yeil by the wqueftce: AA N-terminal 

twsgmttni tiiAt t)OW«£oiv initiate export whereupon 
it IS cleaved fram th& mature protr.in. All ftlgnaf 
w.9ui>nces mm \iO hfi builfc along the 8&me gt9rM»-(U 
linee, bui the flne-Ktnicturu of th« (les^cpfi haa only 
TiMicDily heoome A buhjeflt of study (von Hciini?, 
\m, nJiiaM Perlman A Halvor&on. i9S3). Three 
struct urdJly dheimil&r regions hAtrn been recognized 
Qi) far: a punitivi:^' vhjiqg;ed K-terminal rftgion. a 
centra) hydrophobic roj^on and a more polar 
C-t«Tminid region that umema to clofiri& the ckavaffe 
site. The»e regioita are preHenl in all aigiaai 
sequences but' the limita impa&ed upon them by the 
export maehuzeiy hav«» xt£»t been system aiioalty 
Ktudied; ia particular^ it has not been Ascertained 
whifther they am all equaUy prone to vArietmnA in 
length and aminn anid Rompnaition. ThiH ia a*n 
important qatmtion, since one of the outstanding 
featurefl nf the signal aequcnccia t^vken ae a j^up if» 
their cxtrwirdinary variability in tern!s of overaJl 
length and amino aeid aeqoenoe. 

In this paper» oukofyocic md prolcasy^^^Q aignal 
uequeneeK are grouped acooocding to their- lengtha^ 
and the varJatjone with length of the three 
Mtruutural cegions (termed the n, h, and e -regional in 
what folloivi) an? analysed. Thn anuJyRiia mUuwm 
subtle dl^ifnutuiffi beCwtm tmkuryotiu and prP' 
karyatio iseqiienccA, and Migge^tft ''mining'* 
Toqnh<enient« thnt a fully funoUonal aigiial uuqia«ntx 
muHt (iitnform to, Available data on nnn^funtitionAl 
muUuil sequenues, ae well aa on ovport-compctent 
revertantn, are diBousAed In the light of these 
requirDvnent9. Finally, the functional siguifioaxice of 



the fvsultfi aro aaeessed and reia-ted to current 
modeld of protein export. 

2. MetHoOB 

Th» .lample. under study eemrkns of 11^ eukaryotiv and 
pralsaiyotio flignn) Beqaeoeoei, all with known cIcMLvnj^ 
HtU^Ji. hi tkjui prok&iyoeic Mkuple, no nequencea known tti 
bo fJoATod by the ^'lipopnotnin idgnal pK|7liflnA*f*' {TnrMB f/ 
a/., 10S4) have be&n iaoltided. Unless pthttrwitm indtnated, 
references can be feuad in von Heij^B {EpH4/;]. 

Prokaryatir nctjurtittwt: EMifirruiUa cd^ muttotte binding 
protein: phagQ pBIl!)2Si if-IactamMD; ph&^ MI3 mafor 
imd minor ixtat proteins; B.c^ il-receptor; MmoTtilh 
tiiphlnttHuiin itiftttdJne binding pmtefn: S. typMrnuriwu- 

iyshio-Qfginino-cnuthlfia bimiiiif^ prub^; J?. txjU leucine 

biudinjg; protein: cdi leuoine-ii»leuoine*valiiie binding 
protein; JS^ fioli ^PAblnme bindings protoin; B. cnli 
ffslaeto^e biit<fing protein; E. et4i <^hramofionuU 
^•LiuitftmuA; JS. cdi ompA protoiii; S, Uiti cinpF protein; 
SiCoU ompC protein: E.coli h enUroUixin A aud 
B^bunitft; haklha svhtilie ec-an^laee: S, coU alJcalin? 
pbofiphataBD; E. eofi pfadfi protein; Stajj/iylf>f*/ioyi*Jt (xwittm 
protftln A; (Jortputbaeijsrwini diphthtrioi toxin icix22A; 
£. <;eifi papA; phage Ike ^no Vlll and gene III prCtteina; 
Vihrio ithttkmr, toxirut ctxA itiid okxJi (MekaLauoa 
imVr E, roh' piliu 1e88: S. c&li tolC (Ilackett €t ifl., J A83): 

t.eoti d-^hOM» binding pmtein {Orrurkft tti al., UM^); 
PtewJomffi^ MrUQhiow exotoxin A {Qia,y H W., 
/'MwiofT^om ip, oiii*btuiypej>tidA«e G!! (Mintoi^ «I nl., 

Eukjuyutitt Mquenees: raf. whey p(ioiiphn|)mu»in: 
huffliui scrum albumin: rat ai^aoid giyooprtiteln; mouse 
th^rrntit^pin x-sitibueit: atJMitie; htf^fisb^ «fi9;]Brfifib ^nd 
humiMi influllUft; ovlno p And K-ouclnB; ovine a and 
/^loetaJbamins; rabbit ot'lMtAlbuniin: iduudd immunu- 
gliibub'j} H^chain (H-815); - hybL'i<li>ma inimuneglobullu 
H'Dhiun (93g7}: tiibbit iinmunoglobulfn K-chain (PfiS): 
mouse K-lAkinimogtobulin L-oheins (Ir4Xb, Ii*3l5 and 

^ \Wt AcariemiP Pww In^. (LuiidociJ Ltd. 
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Figure t. Mun h^-diophobicily a» a function r>f the 
lH>Aition from «w» i!W4mc ntfl (between -I and + 1) ftir 
the full *n;k*iQ-wtio ( X ) ftrtd ]W)lt«iy«5tio (Q) aaniIil<M. T\m 
hydfbphobloity ficftlo 119^ ta fn>in ron llcQm (tO^i). 
K^tlv« VAliim lire more bydiojiihobir. 



L-?2J); motuo embr^'onic V„-iii«nnnngk)bnli*i {i'OH 
jQim^o^lobnlin V)|-107; calviaiL immviiojcbbuHn Vn- 
tsmkml VLJ>i^U; hctt melHtm; rat hctdn; bumaa 
pliLOMital tactogfxn; hum&n dioricjlfomdfitirt^pin i^-Hubunit; 
TMbbSt utefogf.fi>Hii>r r4t gn»rtb homonue; b^tvinR fmm- 
thyiuid hnriTKinw: HumAi^ kuki>eyt«. tmrrtunv, and 
fibrablttftt lnt«iferoxui; rat f^xin; chicks tropoejAAtm 
chkkHri av^mucoid,' cbidccii .1>'»oxynie; rJiicIcmi 
ftonalhwmm; human «i-a«litryp(ttn; rat proftlAtio bindmg 
protdna Cl^ CZ and C3; mouae Ib^er amylm; mouAa MHC 
aiiiifi^tn H^Sl^; inoueM MliC I'A tight chain; Tnoivto MHC 
Xtr^t D ^aUi; m«iiiiin ch&Jn; bvirian KI«A.-DR 9 

unrl PR /fi-eh&ina; human HLA DC*o( «Ad 1)0-^ «KaUi9: 
rat caTboxyp«Y|i^i|fiw A; Torptdo mlffiimiea tJcel^yiaMim 
receptor 3. f, y and ^Auhuiuitt: hiuoiAn muBcIt* uiwtyl- 
cholinc ii»ticpt-gn molfie cetn protom 223.1; mouae 03 
onniplwiiient: m pon^Mtii' HNA««i&: tuftHKr. tfipUioniittU^ 
cortin: rat someitfivtattn; homoxi Aiithimmbm Til; mt 
ThyO; bovino Arp-Xpll hormone; ycont mwUwif; 
Tnoi;i)«>thyt6tmpin /f-tiqbiiiiit; hAtmiUir glUCfl^; pfta HMd 
kctin; rat apolipoproteiii F,t barley ff ani;yiaae; human 
up(>lt]^i}irotx)1n Al: iDouKe/f-oryBtiiiiniTacanKiotcnfiiTingen'. 
tr^'|itnfiogen: roc elaAfcaftct T innA II; bovino cnymdnin; 
kgumin; bUitton insulin-like growlb racti»r I; rat aetdhiAl 
vkMh twcretlon IV protoin; ra-bbit poly-lg reooptor; V'8 
vtnu Rlyef»})rntmu G: i^dtmovEruH glyfiapratoin: robUn 
viraft (^ypO]^otdn (ERA); human innucnicii A/Vintnrky 
And A/Jap/ ha«ma^tttinin>i avion iniiiien%» A/FFV/ 
haeina^lutiniii; Htrpti ttimpUx vitnA t^ye-.-} gtycuprotein 
Oi humim porathjTDid htinapiui (K^dy et Ittl, 
yiiant ph05 (Atima «l o/^j 1983); Httrffut nmpiex typed 
glyv^protein U tffn« (Wat«ort» inS3); mt A-ltctalbumiJi 
(Qottba & Bofaxaf 1984]; brrrlne Ot-Npl honiiiKiti 



(RttppBTt a af., liUH); humo]) epidermal growth fiiotor 
receptor (Ulkich a/., 1I)B4); humAn OfariRi'niU^ fucU»r 
(An«on of., l^); human pancmLio |>uK-)ie]itido (Bo^l 
a fli., I'JBil; wboftt gHiidfci (RafAlaki et nL, T ccll 
iiBcientor a-sttbtinit [Soito d M., humoii transferrin 

}^^)\ TKawnatoGoccM ttmi^U thouixiatui ,n (BfAsnA ri at., 

Ni^&fifttg> I084-); human a^iol iiatriurftja Taut^t 
(NBhAy^m* e< <it, 1984]; hunwui HTLV-I *nd HTLV-ii 
env^pe ^ymfwtttmj» (^odm^ ^ D^, i!>S4); himAn 
inHuUn-bTwa growth fai'tw 11 (DuU a/„ WB4); mwrinf 
ftpldfirtnii^ growth &elor binding protolzi (Lundj;rari tit 
lft$4); c«ir acetytoholimt rtuxtpTAr ?-aubunit iTeJcal «t ok. 

htiikiAn ar-Ubriiiu^ {Kant- ti al,r 10$$); human j 
and v-f.brua(^n (Qhun^ tl ul., human fl:-Hapt6- 

giobJn (Yongei al, ti^^); hiimou rtttnc) bliadivg pruteitn 
(Colanti)onl, tftSft); rat cbjTuotryptui (MiK:7>m*«AtI <i 
Idd^); Z>roMfU2a ^rtetanicdsitr glu« protein K (OAi^nlcel, 
1983); VS viratf {X.J, Ofideu) fi3y(*«j»r<tt«M (Callione, 
1083]; Aply^d^ neurdpeptids (Sch«l]cr a).. 1984]. 



3. Results 

(a) 7Ad ^itM&vtio7i is d\ffwtiU Jar 
tt^Tfftftic Q.7id prokaryoiir. AigTtal Mqucnces 

11ms number of Mq-aehO«fi In thi> varicxis Iwigth- 
clasBfia ift givt^^ in Tabic 1. Except for an Axtremely 
Abort seqiWAoe with 1cngt>i i!^ 13, the cukaryotic 
distribution atorts at L^^ \6 i5 aoquentxM) and fchn 
prokttrj'otio at 1/ ■ 18 (3 actjuonces), Moreover^ thft 
m&'ui wbxfjht of the distribiiticinB falb^ betM^cen 

= 18 and I = 20 for the ftofcaryotcs (37%) and 
bi>lweeii L^21 and 1^2^ for ibn prokiu^oteB 
(5JJ%)- Thne, towaitiu their lever ends lht> i wo 
dii^tribiitinnB differ oonfijflt^nliy hy thrae r<s«idwo9. 

(b) The mwiv poniim ojih^ hounJary &<#tcwn tha 

J* tm4 c-ngitma ia iifftrtnl in e^ikaryoUi and 
firokary^m and v»7f vdr^ wiih Imsih 

It haa hmi not«kd thdi the vverall Amino acid 
composition of the n-Y«^ion ia moxt polar than that 
of the h^regsan (von HoTjnc, 10S3). Ind«edi in a plot 
of Lhu moon hydrophobioUy of each poBillon in d 
large enough aiunple of fiignol Gequ«noeta aligned 
from thoir cleavage mlea, iha h/c bmtndary &t4^i» 
out dftarly (Vig. 1), There an obvioua differQnce 
KultAryotw Mid pr^karyotcB, however: in 
Dukaryote«» cho mean position of tbn h/o boundary' 
is between ntFidnw -6 aiid -5, wh«*ftii in 
prolcATyolM it is between msidcea —7 oiid '-O. TYits 
IA tnic d«o for t|i9 iadividnal length-nlMm dc6nod 



Tabid 

A'vflwfrer of^iffnal ^equmce^ vfghxn Imgih m ihe tutaryoHc and prokaryofic sawplu 



Unj;(h U J4 15 Id 17 16 1» «0 2i fid 23 S4 86 iiQ £7 2& itP >90 



EiLkitfyolftH I 0 (5 a 4) iS \1> 12 0 (S A1 U 10 (0 • 1 0 a 9) 2 
Z^Iuryoitoi 0 0 0 0 0 (9 fi A 7) (a 6) {0 4 I 0 0 U C 2 



Th6 poQlod IcnsthHiUnwn pcrorcd.to ia fcho lul Ocv oodoted by pftc^nthttfiM wh^.n Mtoy oaufwipuM 
geQUen^xta of mun this b iuft£;I« WnigLh. 
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Titc h:i^ t7Qun(U^x (uK livflited iii |!% I) u between 
-e And — fi ii) t^c euku^'ctcQ, ttad between ~7 4md -0 iit the 



in Tabic J (d4tA not Hhown); thus, tlic pontion of 
th(^ bouTidaty do«9 not vary with over&ll length. 
Apo^i from tiiia dlffei^nce. the two ourvea fQllcnv 
KRiOi uthbT olowily <rt'«n in the region tfl 
indivAiIiig that noi only thit» oi^xial ttequexK^eti proper 
but- ali»o parti of the Ttt«*ure eequencw are under 
Bimilfir sfllecUvei preaaureB (cf, v<m Hwjne, 10ft4o). 



Table 3 

Nn^iiJber vj euktrffOtk. sequtnant with a giveiL nu)nh^ 
of LeUf Phe nd G7j/ rfMdiue in iht npwn to 
oivi^ tM fji^Kcitd nx:mhera (Mif^ukU^d qO' il^ 
aMwnjflittrt of a binvmiai diMribution 

WcifKwioM a I -4 4 i-v e 7 8 



No. of wquenCM 
EvpecCed (LeU) 
tf»pH!t«d (Phnl 



) $ a9 7 

li It id 33 27 14 

61 :» 1& 3 0 0 

£Q 43 13 ;A 0 0 

98 Id 0 1 0 0 

{Ik's St 2 II 0 Ij 



3 I 0 

A 1 0 

0 « 0 

n ft 0 

Q 0 0 



(eokATyoteft) —IS to -7 (prokfli^'otew) with the 
numbers obtained (or a samplfi with r*widoEnizcd 
h-rqjione (i.o. the airino addii in each individn*! 
h-rogion i« the original samp Jo w<;ro rAndomly 
"scrambled" before thio pair-count wo? pertbrfn(»d)* 
Ai» judged by x^'^^ly^^ (<*^^ dvgiiw of ftvedum). 
no liiipiificajit dcviMions (P<0-05) from the 
expected countd wore found, oithor for n€«reBt- 
neighbours or for rwkfl ficpfitT4t*d by up to tbi-oo 
refiidm, «xcftpt for iien-ilp (t, L+3)-pELirB, whidx m 

in the cuk&ryolEc fmmple (data, nut »hown). This 
Indioatod (hat there aro no strong sequeAce 
amiqtraintfi rn tbe h-veaion hayond (ho ob(itfrvcd 
enrichmt'^nt in hydrophnWe rflsiduen. 



(0) Th^ A-r^on i$ finKcAerf in hydfopho^c 

Siqimee regnlari^lft$ 

Tho hydrophobic ix^nduce Phc. 11c, loa. Mot-, VrI 
and Trp ore enriched in the h-regidnd of boLh 
eukaryotic and prokaiyotio ee^'jenccs, and drop 
jshfti'piy in fhsquancy at the h/c boundary. 
Convez'atsly, tlL« charged Afid polar amino aeidK 
{A*p, Glu, Arg, LyH. Hit, Gly. Pnn Gin, Awi. Ser, 
Thr and T^r) at* virtually fthaenti in the h-rcfion 
but dominate the c-r^on. Ala, which ia very 
ahnndant- in thp pmlca^otio signal soquoni^i doce 
not vary approdabty In incidence across the h/o 
bonndary In either sample (Tabic 2). 

It hae been claimed that the distribution of 
arnir4u a^^ldii in the h^re^^ion h non-randuro (Inouye 
& Halogoua, jeterlroaa & Halvoreton. 1983). In 
t-hc present sample, howerer, no convincmj 
pattftrnR of fin«-Htfiiohife are apparent in this 
refi^on; indeed, the numbor of eukaryotk)* Beqaenceft 
v-ith a givc»n nnmbur of a partirolar amino add in 
U» region -IS to —6 owaoly followR a random 
expectation, i.e. a binomial distdbution (Table 3). 

rt has alao boon etaimed that dome neareet*- 
neighhnnr pairs of amino acids in the h -region are 
pntfHint m nnmbttTH that cannot be oTrplMrod on the 
ba^iH of random pairin^^ (P«*rlman & lialvomon. 
l^i^^). ThiB poMibiHty waa tested in the present 
sample by comparinR the obecr^ed number of pairs 
of ^von amino aoidH in the region ->13 to —6 



(d) The n-regitm atctmn^ejor ont half of thf. 
It9}gth vQfiatmi, hut the mU N^rmiiud cJuirgc 
dotn noi vary with hngth 

The not cliai'ge difttribtit^on in then-refrfon diCTor* 
by one poBitive charge between cukao'ot^is and 
prokaryotea, indicating that the N-terminoj amino 
group in cukar>*Oto& provides one popitive charge, 
whcretf« thi! hlm^od Met, it\ pvokaryotea doc9 m^t 
(von Heijne, IDWfi). A« ifi wlear from Figure 2(a), 
the not N -tormina] charge docei not vary 
appreciably with the orerall length in eilhcr 
eukaryotea or prokaiyotcAr c^^id has a uiean value of 
about -hi "7 in both ijroupB. The length of the polar 
n-regian variea alrongly mill the overall Ifsngth, 
however (Fif;.2(b)); the variation ib ttiiuilar in 
onlraryntes and prokafyoteB) and accounts for 
approximately one half of the total len((th 
variation. 

(e) Thr, hrretfitm otmhitiU f{?r om ha{f vf 
U^t variaiion in avtmH Un^t imt 
iherf. ar< no retfvf/fw jfarktwn^ i7i 
amino wt'rf cmpOiHiicn wUh htigth 

Since the length of the c-region is independent of 
the total lejij^th, the reTnaining half of the length 
variation «tcutv frdin Llje h-rej^on. A» i'h Hhown in 
Tabic 4. there ai-e no regtklax vuriatilonB in amino 
acid fmqnRnciee beiv^een the different length- 
cluttHee, fuid the only auggeative observation fto far i& 
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0, von ndjnc 




Total tvnqth 

fSsnra 2. (») Set cltiu^ «)ju) length «f the ii.regrnH 
K« fmicUoa of thg vwnU kn^h f«>t tlii: (cngtl)-elaji£e!<v 
defined (n Table I (o, prokKi^yote: eoikiMyot©). 'IV 
iniU»t<ur Met w mnmed to ]>roride cine positiYR nharga in 
cho eukATyofceii >iat not an th« pn^yoteft (cf. the text). 
The ij/h bounduy i4 d«ibied by cttii«r (I ) tlw chwgwl 
(Asp. (Su. Aro, JL3TI) or l»rg*> po2w <A«n, Gin. KU) 
rftALdu«». Of (2> tho last pair of amall polar (fler, Tbr» Gljr, 
Pto} Mhldudfr (whi«h«v«r viddft tiw f<>ngMit n-regioxi) 
tb« N''totvnMAl ftldi^ Af'tlw uiibtemipecd nor-potor 



that ih« tthiMinvt- sequenceA {L==\^ in the 
raicaryotw, a«cI I » 18 'in the pwkfw^-otftu) have 
the moni hydcophobic h-rftg5orw» (per rertidttp) both 
in the «»u>caryotoB and the prokaryotcB (the mean 
h>'dr(>ph4>hldty pex m^due in tho region --13 to 
-0 iu the eulcaf^'otfc L = 15 Hcqiiencoa in -8*»kJ/ 
luol, th«> ndxt^lar^t v&iu<f ia -8'3kJ/mo] (for the 
Xr = 30 Be<]u»nQt«); tho voluc for the l^gian - 15 to 
-7 ia tho prokajijotif ^ 18 $t'qiumcee k -9-6 kJ/ 
mol, ihe nBXt-largBat value -is -U'2k.7/TOol (for th«i 

A Rftlftotion of wquencea (cxtrftctwl Irani our fidl 
<*oli©etlon of B0.W 300 enbriou) that tre '^flxtwme'" 
in on© way or another la on display in Figure 3. ^'hc 



'Y. -vt 'fi -to *^ .1*1 •« 

* ^:*rb«bri.i#rr, ttttt: 

& n 1 » I M I '7 : r ; 1 4 ' ■ r a'lfB ^ 1 

1 tit*, af^i^f^ebifix 

>> »*Aiff»:#ribfc^t>:/a*«4i<n'., 

I *t«mTilt*. /. L«lll^^latargfe«iic 
: «i.r>--. "•>:LIAlVt.Alq«^l44h A'*! !: K '* 

II « ; # • f : 1 ; & u 1 .|»*ica 

Fl|!ii]rt 3. A ooUcatioa of ''itxtwtme" al^J ift»(iLH'.n(}M 
(Me the text). The h-region is in boldfaco. i'lne a. mouso 
jJ-c/>*KtalIin; b, K<*?fnrfw» t-oxm (ptiLA); c, hiunan 
KLA-IXJ /?-cl»nin; d. S.mrtus |>«t*Mn A^ huinuit 
iiUltiMixa A/J«p/ hBemagglul^nui; f. Hi. wHi \% entcrQl<axiri, 
A^ttubunit; g,hurnHn § ilbrinogen: h, jiha^e ike gene 
Vi.U-pc<»t*ini i. chicken «"2<I) ijojlagen (Tat«*«4oJ.. 
j,zn(^ui3e MHC YfJ^^h&in; Jchnmon pancreattf^ poly- 
peptHw \, 0. dijththf.r{ar u^icin lox m, humaji rhnrio- 
^tnadotropin, ^•whuiitt; u, hamRtcr glanaguu; coli 
ompA |iH*t«fn; p,i>vine caeoin (Marnier * Oaye. 



first four entries Ahow extremea in ovcraU Icnj^th, 
with a more than t\vofold increnae from tTie shnrt^aQt 
{/r = IS) to the longe9t ^ 3&). The variation i4:i 
ihtt n^region h even more impreEsivo: from one 
retfidue (line 6) to- T7 (lincg). Ttii^ ithortoat 
imkaiyntic h-pegion found so fw Ib only m^tm 
residiiee long (linei); the ionj^&t \s aome l(i're$idue$ 
(linok). For the prokaryoWa, the eorpainmndmg 
value? are nme and l£> [hnetf hand I). In iermH of 
amino add composition, thera ait) h-r^oii» th4}-t 
are aJmoat 100% Leu (Jmc m)» 0% JjCU (iine n), rich 
in Fhb (lirtrh), and rich in Ala (iinoo). Thu 
soqaencc in line d, finally, haa an umiKually long 
c-regioa (lOreflkluaA). 

(g) I^cint fnutaiiana, ddetiows and revertawtA 

(Wanting from the cleaTftge die, almost all 
cxport'-dcflcKnt point muUUoiUi duKcribed so far 



Tabla4 

claaac$ fc/ Tttblt I J 



[!<ru 
Vai 

PllB 

Alt 

fksr 



0'57 

o-oa 

0-07 



la 

O-IS 
0-00 
(MO 

o-a; 



10 

0-85 

0.09 
0^ 



to 

OOfi 



Zl 

0*43 
0*10 

045 



0-11 • 
0-06 



i(4 

o-aa 

0-C7 

0-n 

O07 



Q't3 

0^ 



O'Od 
iHO 



Thi fa-reglDD i« d^flneii nn \hp lf*gm between patiMvki -6 and mean MiUw Uv^ nih 
buiitidaiy ju \taA ijtwtt Vig. 2(bl 
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t\f i,mfftA;.iprriAfet.M' 
4irtw.:t3M; wt*i 1 a 1 1 * f p r I r r I r A r ^ r t ! r -^nr I r 

Figure 4i Hign&l Hefjuenm pnSut mubatioit^ Aiid 
(I«l<)tiott« (Silhav? d iti., 1983). Tiie h-^eec^tm ifl in 



r; a I J B-j I i r I ' * * 0 * t « ■ * ^ ^ » » * -1 ^'itl 

n - I i h P I 1 1 * 11 1 « o t *ti 

6„..»^^ , h 4 k V , t f : 1. m !■ •■ H t ^ k -J 4 ^ *>i > 

•v* ak , V t i. f r fc », r A V P •- ^ w t ■ J * »• ■ ■ i 

u;.^, ,,,,:r^^»l»><etw,*.^V» TCI 

ir»<5 *ti«.-fct.(. <4*6VliT«i«».«"Vb tw 

Hgurc S> ««c|itAncift (i«tt«iitiiis MXid wholly pr 

PAttUHv Rfltored reycHoitts (BanlsMtb Cl al., Kn>r 
A: Silbat)'. lfl&3). The \i-n^\m m tn boWfftoe. The 
jxi!r4«atage vitJ\is« i«f«r tc; th? amount ^^urrecMy 
oxpc>rtMi piot-^un rektivn to wild-type (wt). 



fall in the wrkhi -7 U> -14 (Fig. i) Kxcept ftir » 
Leu-i'j-^Pro mutntion in mtUE, Non-fimctional 
delctiom wid partially or totallj- inistortd 
l^V^iLntB are shown in Figure 5; R^JwOr i* wwma 
that JiJrt intAct h-regSon betwwm niffldnes -7 And 
- 14 {ftounfcing from tho Rtcavngo sito) la i^eqwred 
tor iJrojjw' ftindtioiuii^. Kxeeptioiui nrti the 
fuuctSonftl Atr op Asp routAtiona in iEmli, 

the m\y indicaticiu^ w Xar of po*iiUon-«pecific efiP&ct^» 
in the h-regiou. 

Taken face valufe, the resulte preBen(*d here 
flugge«t that the n, b wtd o-regions that t(>gothfiT 
make up this ftigiial Kequencfe aiT? «i»ler diilererit 
aelfrctivc prcieufM, ^though tbey are by And l&rge 
flimllarly t«elecied in enkaiyoice »nd prok&ryotea. 
The c*region does not oontribntft to the vArifttian in 
orcruJ] luhf^th; it cxtcndfli with .<vni%l1 VftnA^tionji, 
£fom residue .-1 ta — 5 (in eukjwyotea) or -fi (iA 
proJcaryotea), aild it follows the "(-3» - little" foP 
defining thn oluav&go ^Ut {von Heijnft, 1083. 1084»). 

Tho n-xogioni on tho other hand, k gxtremcly 
variable both in tftrma of l«i^:th and amino acid 
compotiStion, but it£ not chajtigo, which iH always 
poaitive uith a moMi vahiR of nbouti 4- doBft imt 
v«iy Hppteciab])' with itfl length. 

ThM L-regiOii, fixmllyj stiU pretstMit^ uomething uf 
an cmigm». Thci fitintifitiaa on th« wild-type 
treqnennes and tb« r^nltg frwQ mnUtioii «tudie6 
that roraduisti —7 to —14 (in prolrATynto.R} or 
-6 to -13 (in enkaryotcfl) we the moat importftiit 
ones and eomctatnte what seeme to be a "ininiEnoi*' 
h-regioa. Overall hydrophobicity msm^ to be the 
un«} governing prixiciple in thi^ rogioni And^ indeed^ 
8tibatitiition« of moro hj-dtophohic for less 
bydropbrtbio retiduea ^eem to make » big difFerence 
in b-ragions th«.t are dose to the mirumAl length 
(Fig. 5): one Sear, Gly, Thr or Pix) can obrloualy bo 
tolemtfid in » '^imnimftV h-i^gion, but not t^o. Tt ir 
unvouifotiftbly true» however, that the hwB Gly^p 



mutatlonn do not fit this pioturo without additional 
oBsumption^ 

Thua, wft are T^ft w^th a piotiir^ of tJie "mmiuuir' 
signal licquenco M tme «iompos*»d of o- five 
(enlcMyutBR) or six (prokiLryiite*) residoe. long 
c-regio«t «iveu (euk*jryotea) or eight (pnikftryotftfl) 
rwidue long h-rtgion with at morft one Ser, Gfy. Tlir 
or Pro among Ihe liydrophobic resudnes; Mtd a one 
(eckaiyote*) or two (proUiiiyot«»B) reitidur ht\^. 
poMtiTcly charged n-region. Thu^ it eeems lhat all 
three rugionn fciin be ono rrwidue shorter in 
eulcarycte», making the AhortcHt eukAT>-otfr. 
eequ«nceB thre« reelcfuofi shorter than the ehott^At 
prokatyotiA ono& (cf, Tabl^ 1). If thv n. h and 
c-regLon& axv ind^>d mdcpviidont, \i should be 
pofioible to makft a ftinctional l^rcaidue long 
OTikaryotiR slgnftl »cqnence (cf. Fig. 3, lino ») and a 
lO-midue long profcaryntic one. If t^w I'egioTJ,'? u-rv 
allow^jd Ui fjverJajj iilightlyi flvfjn sjhnrtBr SftnuRnnefi 
may !)« possible (awi Fig. 3. linop for wn exMtnple of 
h/ooveriap). 

The m&xitnft) limita are harder to find. It vccrnv 
clear that the o-rogion oannot be mnoh longp.p than, 
its "oonBon«U9'* length of iivo or Bix n^iduofi. 
likewise, if the h-region boeoinea longei* than about 
20 n»udve$ it may iii fact anchor the protvin 
pormmintly to tho membrane and turn into an 
If-t€nninftl irffft^-momhrono Boqucnoo (of. von 
Hegne. 1981: Bos «< oi., 10&4). Tho litrntfl oo the 
n-regioitu Anally: 18 reoidtied hi\ve bm atlAchc'd to 
thft M terminiiR of r rlonRd muMWn gwie, making the 
n-i-egion 21 reaiduefl Iwng, with no nffect upon 
export (Talmadge et al., 1931); on Uie other hand, v 
mulMit SMhk virns glycoproUsia tv-ith itit N 
terminu* fiwed to a 3O.(lO0 ^f, ftytoplaamic protein 
is not oxported (Wirth et al, 1079). The exact limit 
rfeULtuM unknown, howeviw. 

What, then, are the fnnctiofui of thefie regions? 
Btmdrttfally, they appear to ba qiait^ mdepcnrtcnt, 
(ind they do not seem to be eo-isteleeted In any 
important way. TJie o-region h clearly involved in 
defining the cleavftgc flitc (von Hoijne, I9B4a); th« 
h^rogion hoH been ttuggftsted bb being the tOrrget for 
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thft "^liRnal WCOgnitlon particle" (SHr) v;hich 
impoitt^ a tranalaliotml blocit on cytQpWnnc 
Wli<»fi(^mw dynthet^lng prot«fn» d^tincd for cxpoL*!? 
(Walter L9dl}; »nd the i^-yegion miiy havo 

Rometthing 10 do witli the '-docking protein" 
t^inlyiitKl nilftMPi of th« SBP-liwluced block (Hall 
nL. VLimit VflW?*^. Tn nonb-Aal tft lJi« 

uriiv<!ir«Ally eoiijsorved H, — >Ho»i'gn" of th« 
cteagft filte (von H&tjne, IttSH). boiwcr, no 
Hp«d(ic pattcmB of *rrtinfl ftcldfl have yvt jlwen 
dotocl^ in the re$t of the atgnat Beqf^zpnce. curiid the 
h as weU AA Vhe u-ref^n ea^m far tno variable, both 
in Wrtna of length aiid <u»uno acid actquenoe, to 
allow.* for Dpedtic pi^tein-protein IntevACtlonB each 
aa those cnTiaaoncd in th:e ^aniint modeb of how 
SRP w>rlqi. Tn^t^wl, both rogious eeem woll-(Aiit«d 
for biiidlug ut a mthei unspecfric way to the miTfknB 
(Q-fogion) nud to th« interior (h-r6gipn) of 
mmnbronMi, aft «iiggHrt«l rftpAattiiHy in thp 
litioratoTO (l>iRi<in7*6 d., lft7H; von Hoijnc $ 
Blun3b«i;g. 1979; Wickoer, 1980; XSueelmim & 
5teJtz> 18S1}. 

Tha length variaticcaa gbtitn'Ctl bri tTie h*regioii 
( -^6 to <^20 rediducd) may inditiato that thio ro^on 
fipaiui the 25 Lo 30 A thick non-polar uiterimr of the 
membrotie aft 6, fitmctore composed partly of 
oE^helix, YmHly of eaetendcd chain doponduig On itQ 
length: eight fCfridueii in a AiUy extended Btrnctore 
will hnve a length of some £27 A, cflOAo to the l»higth 
of a 2Q-reaidue long helix. Sinde the helical 
r^mfonnation ichould be ihwnndynaTnically 
prefcrrod in a non^poiar envLroiuUBnt. h-tf^ona of 
intermfsdiato length would epan the membrane 
OS part a, part nxierided-chajA HtracturRsi (cf. 
Bodouolle k Hofnung, 19S1). 

However, this doee not nxpiain thr, SEF-cffect. 
By fiubatituting a pclav leucine onaiogne for r.en in 
a Ti«u^Hch signal aoquence one uan bypw the SRP- 
induoAd tronelatfonal blook an.d get a cytoplaanaSc 
protein; with a Leu-pooi signal twquenoe thius does 
Aoi happen {Waiter o/., 19ftl). It may he, though, 
that Uiia doea n(»t come al>o\i( m a result icf a diftci 
inUroctloT^ between the ftignal sequence and (ihe 
HRF; rather, the ijelation batwafln the two may be 
indirect and noT depeodenl upon a sfteclfin protehv* 
protein inUrraetion. One might »paoi4late that ths 
blgtial aeqtieriott aomohow fntcracta wlUi the 
H\ioaome rather than wHh thd SR?, aiid that the 
ha)t» tra^iaiatlon -by interacting with the 
riboaomal translocation aite on riboaomea 
''aensiliKOd" by the pmence of a. aignal iiequonce. 

Thto work waa eupfH^tttid by * gran fc;froni the fevwUnh 
Xatvral ijciiuiceii RrJiearch Coiindl. 
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Tht froctici tv wAiet) prot»jna ire trmsporUd aci^oi* 
RO»t)r«ne:i vac preeiantcd Aa o dec«il«d lirpatbaslE* iaa 'signaJ 
h790t&fl«ii' Jit i9?S tU And &iibs«quently 1» revised foms (2.3;. 
Ab ft«if»TitJAl £«aturft ia the vro9*n<:fl of a folrp«ptide o( ;O-^40 
«nino *6iilG, pred^ainantir hirdroptio&ic, at the Kvino ttroinue oC 
«Bcrettd proiftixi0. Tbli bindtf to this ettdPpiABmf g r9kiGulm «iid 
tr&niport is InltUtod hn^ore tronslntion la cdmpietv. Rvsplta 
sums contrtdietory «fld«nti» 4-6> ths «9««ntiA]s a| tti« 

ei«naJ bypotheslfl hovr fa«oii wld«lsr 46flBpt«d ind hiva ba«ii 
ckttfnd«d tft Jncliij(0 protflin^ wbieh b«e(i«i« integr&Ud into 
QflnDrantrti (a*7j And b«cterl«i »ytft4Rp (dJ. 

The currsnt lifpath(»ai« r«qulr«9 tdat tbP slflattJ POptidP 
itufll &■ tranalant. »ove«ar« 4 nonbar af socrci^ed and 
aembran»-bou»d pi-utf Jns bava tipw been abowg Xq hava U*'tQraJnaJ 
saquaficaa which r»9«BbU I6a translfni pftptidav, but which ara 
not claavad. H kns baan «bown that ebBue gaqoances con 
funetlOR A5 aignala l<^^ tranitJpcation incp the rough andepiasmic 
riticiflm (*.S0). TbafBforf, tbay havf bean inelodad m (bis 
eonpiJation. 

Protain^ uUith att (ranafarrad across two nanbrana biJayars 
into laMocnottdria snd ebloropltntrii b^tf« ajso nov bean foqnd to 
hflue trineUftt inxna Cenaina: realona C3.U|J2K A aafor 
tfilioronca la tJia trantport ©r th^se urotaina io tli&( it ocwrs 
pofit-rraiiaJatlonaUy and Jt i» cicAf IbUt their a^pnals ira 
qujt» diffirfni. 
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'V^PAtKDU HXVUTPnrlVX 



-iiiitSne p^tij^i * - perlpUdmU pr.i.it^; t - >3"'"V/:^'•^''' 
i , iranBm.fflUran. protein; 7 - AClH . 

.fc « . _ a*. » . ■ p ft* 1,4.1111 Mfli fta ■ _ 

riiiw^iirulVlniii; h. I. r Ji&li- tonim • Vj"^ 

tifturaRinitfaftc; N.J. Hew JtrMVi orn « orullhiw; Pf' " _ 

Subynjllj i - vmcnown fmin* *cld; A . aU; c - cy»i D - o»Pi E 
DiS; r - Pho; c - cJy; k « bUj i - Ue: K « lya; L • Jj^- ; 



h tcUl ol i?? {iBii«9>ie«» arff prftaentad here* A tfOt*iJ*tf 
ftnaivfti^ «( tNir strcclwrB iti b"V"*l r-port. 
Hw««pr, iTieir Pii«t fttriitlM slBiUfMlM luclnifc thtit lenatb. 
ili»ir uirarophobiciir "d thi* H»ltod nmbor *if "iff* w>»Jc» 

QccBr *t the c*pl»o<*3 t.rBlnl of iht sianeU. Tfcf» 
Ctirmvt Of * 4l«rnil to Ucicde a cUrflod re*itfu« within Ibe 

Urst nv* *fflino Aciifp, folliJWPtf by • core oi *l Un9\ nine 
hvdrojibiitile reaidtiii whioh «lio«ld b« *tifi:tlont to span & 
BfntbrAn«. A balia broillw resltfuo (gJyclne or prolinoJ or a 
l.rflo poJir runiduo UotebJff oJuUftifio)* frottttonily ocfiura four 
to eiolit reildoeo BOforo »ba eioivogo Bn«. Tho pattarn o£ 
QDino o«ids near lb* clctyano tiStos oi eukorvotic tnd flnl 
proteitio Hab been diieueoed >» a*«on by tffio HeJjne antf A flcncDO 
Euggftfltod for fiAUnJatlaO thb prob3blHty ft£>rotoaaiTiH Bflet 
oocn reoiddo <2*). Tbo noat atrlkino feottjro of cloo^eco fl(e& 
i« tbo Pfoeenc* of ooiiao &«ldo vHh auoJJ. unehora**! eldo ohelna 
at tbi carbe*yj lomlnl of tb« Bl»nala. If onJV tliose wbofo IHe 
Giftavaeo cite la weJJ obaracteritiod ura conoidertd, of i3« ol 
ibe onkofyotic ol»n«iJ» eod with alanlao or oiycino ana « 
fortbur end with aorUo, cratolne or ibreonlfto, if I6 viraJ 
fiUn^lo «rp inciudod, thooo parcBntoBoo ^oooiio yi% tod 
reopeetltely. For tao 40 w?n obaracteriostf prokaryotfc oiffnais 
m tnd with elanlM or aJyolM • fortbor 10% with iierine. 
croteipo or threonine. 
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Pr9DaN0iA« it^obAbi iitits hatfl been cslculotod for ittcfi 
Beqtfonfi* in coDPilAtlon tdnu not shovn). For 74« oc thu 

■vkvrvotSc Had (^ir«j pfot»lni (be oftse^vitf eieDvagt sitik h*^ t%t 
hJghaBt provesflinff probability. Fpr ■ (irrKhur ill tite »lio with 

AFA b4«9tf on tfata aukdryotle aqtf viral proetini oiilir. 

Hew9v»rr for cDmparieon« tbvir wera iiso ciJcoJAUtf <(ir th« 
preRtfvoMo ffoquanetfl. 39% oi (bv oba«rved eiisa wtre in 
CGBplet* aariiflstnt witb tbo preificttoiis. A further lS% ara 
^Uavad It second nost probabJ» slU. tG« p^oeess by whi£b 
prokaryotlc pr^taiAo ajr» ixportpd U Bor« conpJax than tor 
#vitarTotia pr^lflina BJne« the eorrdcl targeting proteina to 
Ub ien«r or oatar MqbranaS or to the poriplaAAlti spa^t mst 
ooeur. ;(nfora&tiofi in ib« naturo protein aaqufrncAS nav be 
raaponaibJo for laoir ujci«oto local Jsatilon ^itl'^lQ). 
Cxperinental avltfonco for tha meobanidiia invalvod le 4« Vat 
insBf f itiloTit and in aptte casptf ia (ontradlctery (30idlJ). 

SoQc oi tbo oipnaJ aequiDopa are UAoonailv Jeng, noUblT 
tboia Df tfao 0aoJlli (3i-44 raaidnap) and Ibo viral anvalopa 
Oiyeoproteipo and «4 roalduasK In tb« J«tUr tbf moat 
bydropbobie raoionA are tdwarda Iho earboxyl tarmfni of thpir 
4ign«ja :ia it bo tbat thaa« tsprosank 'intamal' iipntiff, 
Tho 5lgo4]» rocpbnaJbJa for targoUng protains to ni tocbondria 
■Dd ebioroplast« aro i)«p iong rosldves). Thop ara vary 

dtffarp&t (o (bp other tfignais pro^entad. Tbia la nti 
aurpfCalng nintt lli«y naat ba eSearjy dlfttinpuiahad from 
'pjtport' sifftttlP and bfcauaa tbair tranaport la inJUatatf only 
after traitfllatian i9 ftoapltto. It la not posslbja to daffcrtb* a 
ff«ADral «tractura froa tbp aaaM iiopio available* 
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